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Abstract 

We consider parabolic stochastic partial differential equations driven by white 
noise in time. We prove exponential convergence of the transition probabilities 
towards a unique invariant measure under suitable conditions. These conditions 
amount essentially to the fact that the equation transmits the noise to all its de- 
termining modes. Several examples are investigated, including some where the 
noise does not act on every determining mode directly. 
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1 Introduction 

We are interested in the study of long-time asymptotics for parabolic stochastic 
partial differential equations. More precisely, the existence, uniqueness, and speed 
of convergence towards the invariant measure for such systems is investigated. The 
general setting is that of a stochastic PDE of the form 

dx = Ax dt + Fix) dt + Q duj(t) , aj(0) = x , (1.1) 

where x belongs to some Hilbert space H, A is the generator of a Co-semigroup 
on 7i, F : H — > TL is some nonlinearity, uj is the cylindrical Wiener process on 
some other Hilbert space W, and Q : W — > H is a bounded operator. If the nonlin- 
earity F is sufficiently "nice", there exists a unique solution x(t) to ( |l.l| ) (see e.g. 
[ DPZ92b| ]). In this paper, we investigate the asymptotic stability of ( |1.1[ ). We say 



that the solutions of are asymptotically stable if there exists a unique proba- 
bility measure on W such that the laws of x(i) converge to independently of 
the initial condition xo- We are interested in the situation where the asymptotic sta- 
bility is a consequence of the noise (i.e. the deterministic equation x = Ax + F(x) 
is not asymptotically stable in the above sense), although the noise is weak, in the 
sense that the range of Q in H is "small". 



The investigation of asymptotic behaviour for solutions of (1.1) goes back to 



the early eighties (see for example [ [VIS99 ] for an excellent review article or the 



monograph JDPZ96 ] for a detailed exposition). Until recently, two approaches 



dominated the literature on this subject. For the first approach, sometimes called 
the "dissipativity method", one considers two solutions x(t) and y(t) of ( |1.1[ ), corre- 
sponding to the same realization of the Wiener process uj, but with different initial 
conditions xq and y^. HA and F are sufficiently dissipative, \\x(t) — y(t)\\ con- 
verges to for large times in some suitable sense. If this convergence is sufficiently 
fast and uniform, it yields asymptotic stability results (see for example [ DPZ92a| ]). 



Closely related to this approach are the Lyapunov function techniques, developed 
for semilinear equations in [ jch84 ]. The dissipativity method, as well as the Lya- 



punov function techniques, are limited by the requirement that the deterministic 
equation x = Ax + F(x) already shows stable behaviour. 

The (linearly) unstable situations are covered by the second approach, to which 
we refer as the "overlap method". It consists in showing that the Markov transition 



semigroup associated to (1.1) has the strong Feller property and is topologically 



irreducible. Then, provided that the equation (1.1) shows some dissipativity, ar 



guments as developed in the monograph [ ]MT94| |, allow to bound the overlap be- 
tween transition probabilities starting at two different initial points. This in turn 
yields strong asymptotic stability properties. The main technical difficulty of this 
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approach is to show that the strong Feller property holds. This difficulty is usu- 
ally mastered either by studying the infinite-dimensional backward Kolmogorov 
equation associated to ( |1.1[ ) [ pPZ91 ], or by showing that the Markov transition 
semigroup has good smoothing properties [ DPEZ95 , |Cer99j ]. This technique is 
limited by the requirement that the noise be sufficiently non-degenerate. A typical 
requirement is that the range of Q contains the domain of some positive power 
of —A. To our knowledge, only one work J5H01 , HaiOl ] shows the strong Feller 
property for a stochastic PDE in a situation where the range of Q is not dense in Ti 
(but still of finite codimension). 

Very recently, a third approach, to which we refer as the "coupling method", 



emerged in a series of papers on the 2D Navier-Stokes equation. (See QKS01 



MatOl, MY01[ ] and the references in Section |6[) The main idea of these papers 
is to make a splitting TL = TLl © TLh of the dynamics into a finite-dimensional, 
linearly unstable, low-frequency part TLl and a remaining infinite-dimensional sta- 
ble part TLh- An important assumption on Q is then that the range of Q contains 
TLl- The space TLl is chosen in such a way that the long-time asymptotics of 
the dynamics is completely dominated by the behaviour of the low-frequency part. 
More precisely, for any given realization XL(t) of the low-frequency part, the dy- 
namics of the high-frequency part xn(t) will loose memory of its initial condition 
exponentially fast. On the low-frequency part, in turn, the noise acts in a non- 
degenerate way. A clever coupling argument allows to combine these two facts 
in order to obtain asymptotic stability results. The argument consists in coupling 
two realizations of ( |L~I| ) in such a way that if the low-frequency parts meet at some 
time r, they remain equal for all times t > r. (Of course, one has to show that r is 
finite with probability 1.) In fact, this coupling method is very close to the Gibb- 
sian approach developed in [ KS00| , BKLOOb, EMS01], which consisted in trans- 
forming the infinite-dimensional Markovian system on TL to a finite-dimensional 
non-Markovian system on TLl- This finite-dimensional system was shown to have 
exponentially decaying memory and thus techniques from statistical mechanics can 
be applied. 

Loosely speaking, the coupling method combines the arguments of both the 
dissipativity method (on TLh) and the overlap method (on TLl)- The coupling 
method thus yields a very powerful approach to the problem of asymptotic stabil- 
ity of ( \l.l\ >. The conditions of applicability of this coupling method have been suc- 
cessively weakened in the aforementioned papers, but the existing results always 
require, as we already mentioned, that the noise acts directly and independently 
on every determining mode of the equation. In this paper, we extend the coupling 
method to problems which do not satisfy this condition. Our overall approach is 
similar to the one exposed by Mattingly in [ MatOlp , and consequently some of our 
proofs are closely related to the arguments exposed there. Our main new idea is to 
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construct a coupling for which the low-frequency parts of the dynamics do not ac- 
tually meet at some finite time, but converge exponentially fast towards each other. 
This "asymptotic coupling" is achieved through a binding construction exposed in 
Section [2.3[ , which seems to be new and can in some cases be implemented even in 
very degenerate situations. 

In the following section, we illustrate the method of asymptotic coupling for a 
simple finite dimensional problem. 

1.1 A toy model 

Consider the following system of stochastic differential equations in R 2 : 



dx\ = (2xi + X2 
d,X2 = (2X2 + xi 



xf) dt + du(t) , 
x\) dt . 



(1.2) 



This equation should be interpreted in the integral sense, with u> E U a Brownian 



motion. Applying Hormander's condition [Hor85, Nor86], it is easy to see that the 



transition probabilities of (1.2) are smooth with respect to the Lebesgue measure 
on R 2 . Furthermore, an easy controllability argument shows that they have support 



everywhere and therefore are all mutually equivalent. Since (1.2) also exhibits a 
strong drift towards the center of the phase space at large amplitudes, it follows 



by standard arguments that ( \l.2\ > possesses a unique invariant measure [i* and that 
every initial condition is exponentially (in variation norm) attracted by /i*. 

The problem with this argument is that it heavily relies on the existence of 
some reference measure (in this case the Lebesgue measure) which is equivalent 
to the transition probabilities. In the infinite-dimensional setting, such a reference 
measure will usually not exist when the noise is sufficiently degenerate. (For an 
account of some cases where such a reference measure does exist in the infinite- 
dimensional case, see [ MS99| , EH01].) Furthermore, the fact that both directions 
in ( |Q| ) are linearly unstable prevents one from applying the coupling method as it 
is presented in the previous section. 

We will show that the invariant measure for ( |P| ) is unique, using a coupling 
construction which pushes solutions together at an exponential rate. This construc- 
tion is asymptotic, compared to more conventional coupling constructions, which 
look for hitting times at which the coupled dynamics actually meets. 

Before we proceed, let us explain briefly what we mean by "coupling". A 
coupling for (L2) is a process (x(t), y(t)) G R 2 xR 2 , whose marginals x(t) and y(t) 
taken separately are both solutions of ( |I~2| ) (but with different initial conditions). 
In general, one takes a measure P on (1 x fl, whose marginals are both equal to 



the Wiener measure W. Then a coupling for (1.2) can be constructed by drawing a 
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pair (oj, ui) G 0, x Q distributed according to P and solving the equations 

dxi = (2xi + x 2 - x\) dt + dw(t) , dy x = (2yi + y 2 - yf) dt + du(t) , 
dx 2 = (2x 2 +xi- x\) dt , dy 2 = (2y 2 + y\ - y\) dt . (1.3) 

We will carefully choose the measure P in such a way that the quantity \\x — y\\ 
converges exponentially to for large times. This then yields the uniqueness of the 
invariant measure for (|L2|). 

Our main idea leading to the construction of P is to consider the following 
system in R 4 : 

dx\ = (2x\ + x 2 — x\) dt + duj(t) , 

dx 2 = (2x 2 + xi - x\) dt , ^ 
dy 1 = (2y 1 + y 2 - yf) dt + du(t) + G(x\, x 2 , yt, y 2 ) dt , 
dy 2 = (2y 2 + y\- yl) dt , 

where dm denotes twice the same realization of the Wiener process. We see that 



this equation is the same as (1.3) with oj defined by 



u(t) = oj(t) + f G(xi(s), x 2 (s), yi (s), y 2 (s)) ds . (1.5) 
Jo 

The noise oj G Q, is distributed according to some measure VV which is in general 



not equal to the Wiener measure W. Therefore, < jjL4 ) does not yet define a coupling 



for (Oh. If G is small in the sense that the quantity 



\\G(x 1 (s),x 2 (s),y 1 (s),y 2 (s))fds (1.6) 

o 

is bounded with sufficiently high probability, then the measures VV and W are 
equivalent. In this case, it is possible to construct a measure P on O x H whose 
marginals are W, with the important property that there exists a random time r 
with P(t < oo) = 1 such that the solutions of the coupled system satisfy ( |L4| ) for 
times t > t. 

In view of the above, we have reduced the problem to finding a function G 
such that the solutions of (L4) satisfy \\y(t) — x(t)\\ — ► for t — ► oo and ( |L6| ) is 



bounded. We introduce the difference process g = y — x, and we write 

gi = 2gi + g 2 - gi {x\ + x\y\ + y\) + G(x, y) , §L%) 
= 2g 2 + g x - g 2 {x 2 2 + x 2 y 2 + yl) . ([D|b) 
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It is easy to find a function G such that q\ — > 0, but this does not yet mean that Q2 



will go to zero. A closer look at (1.7 b) shows that if we could force qi to be very 



close to —3^2. ( |l-7| b) could be written as 

Q2 = ~Q2 + £ - Q2{xl + X 2 V2 + , 
which is asymptotically stable. Introduce the function £ = qi + 3f?2- We then have 

C = (•• •) + G(x 1 ,x 2 ,yi,y2) , 

with (. . .) an expression of the order ||£>||(1 + ||x|| 2 + ||y|| 2 ). Now we can of course 
choose G = — (. . .) — 2£. This way, the equation for Q becomes £ = — 2£ and we 
have the solution ((t) = ((0)e~ 2t . Plugging this into (|L~7|b), we get 

Q2 = ~Q2 + C(0)e -2 * - Q 2 {xl + X 2 V2 + yl) ■ 

We thus have the estimate 

|02(*)| < |^(0)|e-' + |C(0)|e- 2t . 

Finally, qi is estimated by using the definition of Q and we get 

\Qi{t)\ < 1^2(0)16"* + 4|C(0)|e~ 2t . 

This shows that, with G chosen this way, there exists a constant C such that 

\\x(t) - y(t)\\ <C||x(0)-y(0)|| e -' , 

for almost every realization of the noise. Since typical realizations of x(t) do not 
grow faster than linearly, G is also of the order e~*, with at most a polynomial 
factor in t multiplying the exponential. The main result of this paper, Theorem |4.1[ , 
shows that the above construction implies the existence and uniqueness of an in- 
variant probability measure fi* for the problem at hand. Furthermore, it shows that 
the transition probabilities converge exponentially fast towards /i* in the Wasser- 
stein norm (the dual norm to the Lipschitz norm on functions). 

This concludes our presentation of the toy model. For a more precise statement, 
the reader is encouraged to actually check that the above construction allows to 
verify the assumptions stated in Section ||. 

The remainder of this paper is organized as follows. In Section ^, we give 
the precise definitions for the type of coupling we will consider. In Section ^j, we 
state the properties of the coupling that are required for our purpose. In Section ^ 
we prove the abstract formulation of our main ergodic theorem. In Section |5], this 
abstract theorem is then specialized to the case of stochastic differential equations. 
In Section |6| finally, we present several examples where our construction applies, 
although the noise does not act directly on every determining mode of the equation. 
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2 The Coupling Construction 

In this section, we explain our coupling construction. Before we start with the 
actual definitions of the various objects appearing in the construction, we fix our 
notations. 

2.1 Notations 

If [i is a measure on a measurable space X (in the sequel, we will always consider 
Polish^] spaces) and / : X — > Y is a measurable map, we denote by f*fi the measure 
on Y defined by (f*fj,)(A) = /i(/ -1 (.A)). For example, if II is a projection on one 
component of a product space, LF/i denotes the marginal of fj, on this component. 
If a natural reference measure is given on the current space, we denote by Vfi the 
density of fi with respect to the reference measure. 

We define for any two measures /j and v the measures \i A v and \i\v. If a 
common reference measure is given, these operations act on densities like 

(V(p, A v))(x) = mm{Vn{x) , Vv(x)} , 
(T>(ji \ v)){x) = max{XV(x) - T>v(x), 0} . 

It immediately follows that /j = (fj, A v) + (pi \ v) for any two measures \i and v. 
We will use the equivalent notations \i<v and v > \x to say that fj, A v = fj, holds. 
One can check the following relations: 

/* 0i A v) < /> A f*u , 

/V\^)>/V\/v. 

Equalities hold if / is injective. 

For a given topological space X, we denote by M(X) the space of all finite 
signed Borel measures on X. We denote by Mi(X) the set of all probability mea- 
sures on X. For \i € M(X), we denote by its total variation norm (which is 
simply its mass if /x has a sign). 

i.e. complete, separable, and metric 
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2.2 Definition of coupling 

In this section, and until the end of the paper, we will often consider families of 
measures indexed by elements y G Y, with Y some Polish space. One should think 
of y as the initial condition of a Markov chain on Y and of Q y either as its transition 
probabilities, or as the measure on pathspace obtained by starting from y. We will 
always assume that the functions y i— > Q y (A) are measurable for every Borel set 
A. If Q y is a family of measures on Y ra and K y is a family of measures on Y m , a 
family of measures (RQ)y on Y n+m = Y ra x Y m can be defined on cylindrical sets 
in a natural way by 

(RQ) y (AxB) = J^R Zn (B)Q y (dz) , (2.1) 

where A C Y n , B C Y m , and z n denotes the nth component of z. 

We consider a discrete-time Markovian random dynamical system (RDS) $ on 
a Polish space X with the following structure. There exists a "one-step" probability 
space (Q, F, P) and is considered as a jointly measurable map $ : (X, S7) — > X. 
The iterated maps $ n : (X, Q, n ) — > X with n G N are constructed recursively by 

<S> n (x, U!,...,U n ) = $($ n_1 (x, Wl, . . . , W„_l), w n ) , 

This construction gives rise to a Markov chain on X (also denoted by <J>) with one- 
step transition probabilities 

P x = $(x,)*P . 

The n-step transition probabilities will be denoted by P™. Our main object of study 
will be the family of measures on pathspace generated by $. Take a sequence 
Wi}i^o an d an initial condition x G X. We then define xq = x and Xj + i = 
<fr(xj, Wj). We will denote by P" with n G N U {oo} the measure on X n obtained 
by transporting P" with the map {uji} i— > {xj}. It is also natural to view P" as a 
measure on X n x O ra by transporting P n with the map {iOi} i— > {xj, cjj}, so we will 
use both interpretations. 

Remark 2.1 The above setup is designed for the study of stochastic differential 
equations driven by additive noise. In that case, Q, is some Wiener space and $ 
maps an initial condition and a realization of the Wiener process on the solution 
after time 1. Nevertheless, our setup covers much more general cases. 

The coupling needs two copies of the pathspace, i.e. we will consider elements 
(x, y) G X°° x X°°. It will be convenient to use several projectors from X N x X^ 
to its components. We define therefore (for n < N): 



U 1 :{x,y)^x, U 2 :{x,y) i-> y , ir n : (x, y) h-> (x n , y n ) 
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We also define m ;n = LTj o 7r n for i € {1, 2}. 

Definition 2.2 Let Q be a Markov chain on a Polish space X and let be the 

associated family of measures on the pathspace X°°. A coupling for 3> is a family 
C™y of probability measures on X°° x X°° satisfying 

HiG^y = P^ and n 2 C^ y = , 
where II i and II2 are defined as above. 

A trivial example of coupling is given by CSj!°„ = P£° x P^°. The interest of 
constructing a non-trivial coupling comes from the following observation. Take 
some suitable set of test functions Q on X and define a norm on M(X) by 

IHb = su p (9^) ■ 

gee 

Once the existence of an invariant measure for the Markov chain <3? is established, 
one usually wishes to show its uniqueness by proving that $ forgets about its past 
sufficiently fast, i.e. 

lim II P" - P" lie = , for all (x, y) e X 2 , 

n — >oo " 

with suitable bounds on the convergence rate as a function of the initial conditions. 
Now take a coupling C^ y for <£. It is straightforward to see that by definition the 
equality 

(P2,9)= [ 0(*)«nC»)(d*) 
JXxX y 

holds, as well as the same equality where 7Ti n is replaced by 7T2 jn and P™ is re- 
placed by Py. Therefore, one can write 

II P" - P;ib = sup / (g(U lZ ) - g(U 2 z)) «C« y ){dz) . (2.2) 

This equation is interesting, because it is in many cases possible to construct a 
coupling C^y such that for n large, the measure 7r*C£° y is concentrated near the 
diagonal Uiz = IL2Z, thus providing through (2J2) an estimate for the term ||P™ — 
P™||g. This is precisely what was shown in our toy model of Section LI, where we 
constructed / in such a way that \\x(t) — y(t)\\ — > for t — > 00. 
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2.3 The binding construction 

In this subsection, we describe a specific type of coupling for a given RDS <£. Only 
couplings of that type will be taken under consideration in the sequel. 

Let $ and the associated probability space ($7, F, P) be as above. We consider 
a family rp x ^ y : O — > $7 (the pair (x, y) belongs to X 2 ) of measurable functions 
that also have measurable inverses. We will call these functions binding functions 
for <I>. The reason for this terminology is that, given a realization {w„}™ =0 of the 
noise and a pair of initial conditions (xo, yo) £ X 2 , the binding functions allow us 
to construct two paths {x n } and {y n } by setting 

Wn = ^„^j„(Wn) , x n+ i = $(x n , u n ) , y n+ i = <£>(?/„, u) n ) . (2.3) 

Our aim is to find a family ipx-,y such that y n converges towards x n in a suitable 
sense for large values of n. Thus, the binding functions play the role of a spring 



between x and y. We will say that ( |2.3| ) is a binding construction for We denote 
the inverse of i^ x -,y by ipx^y The reason behind this notation should be clear from 
the diagram below. 



x n Un 

$( • , U n ) -< 0J n - t OJn >■ $( • , & n ) 

^Xn^y-n 

X n +l Un+1 



(2.4) 



The solid arrows denote the various maps and the dashed arrows denote the influ- 
ences of the appearing quantities on those maps. It shows that it is also possible 
to achieve the binding construction by first choosing a sequence {w n }™ =0 and then 
using ip Xn ^y n to construct the uj n , thus obtaining the same set of possible realiza- 



tions for (x n , y n ). This symmetry between ip x -*y an d ipx^y is also explicit in (2.6) 
below. 

Guided by the above construction, we use the binding maps to construct a cou- 
pling Markov chain $onXxX with transition probabilities C XjV in the following 
way. Define the maps 



^>x-,y :fl->fixfi ^x^y ■ H — > ^ X ^ 

CO l-> (uj,ip x ^ y (uj)) , UJ (-> (^^-kM,^) . 



(2.5) 



Notice that, up to some null set, the image of both maps is the set {(oj,lo) \ uj = 
ipx^yi^)}- Then we define a family of measures P on Q, x Q, by 

P^v = (*:^,P) A (**^P) = **^(P A C^P) • (2.6) 
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According to (2.4), the measure P x Vn is precisely the common part between the 
measure obtained for (u n ,u n ) by distributing uj n according to P and the one ob- 
tained by distributing u> n according to P. Thus both marginals of the measure P xy 
are smaller (in the sense of Section |Q| ) than P. In order to have a non-trivial con- 
struction, we impose that the measures P and tp x ^ y P are equivalent. The density 
of 4>x->yP relative to P will be denoted by V Xty (u). 



Considering again ( |2.4| ), the family of measures P xy is transported on X x X 
by defining 

$ XtV :l]xfi^XxX 

(w,w) i-> ($(x,uj),<5>(y,<Jj)) , 

and setting 

Qx,y = K,yP X ,y • ^-J) 

But this does not give a transition probability function yet, since the measures P xy 
are not normalized to 1. We therefore define the family of measures P x by 

K,y = ? X ,y + (P \ n*lP X> y) * (P \ ^y) > 

where the number c XtV is chosen in such a way that the resulting measure is a 
probability measure. By a slight abuse of notation, we used here the symbol Ilj 
to denote the projection on the ith component of $7 x Q. As a matter of fact, 
(P \ W[P xy ) and (P \ IIIPjj. y ) have the same mass, which is equal to 1 — HP^ J|, 
so 

1 



x ' y HP\n^P^||' 

for example. (Recall that the symbol || • || stands for the total variation norm, which 
is simply equal to its mass for a positive measure.) It is straightforward to show 
that the following holds: 

Lemma 2.3 The measures P x y satisfy n*P x . = P for i = 1,2. 
Proof. It is clear by (^) that U*P x y < P. Thus 



njp^ = uiP ttV + c x JP \ U* 2 P X J(P \ UlP x>y ) 
= (PAUlP x J + (P\UlP Xty ) = P, 



(2.8) 

and similarly for LT^Pj, y . □ 
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This finally allows us to define the transition probabilities for by 

C x , y = K,yK,y^Qx,y+R X ,y. (2.9) 

In this expression, the only feature of R x ,y we will us e is that it is a positive mea- 
sure. We define as the measure on the pathspace X°° x X°° obtained by 



iterating (2.1). Since Hi o Q xy = <fr(x, •) o Ii l and similarly for n 2 , it is straight- 
forward to verify, using Lemma 23, that the measure C^ y constructed this way is 



indeed a coupling for <3>. 

For a given step of 'J, we say that the trajectories do couple if the step is drawn 
according to Q x , y and that they don't couple otherwise. 

Remark 2.4 Since P xy is a family of measures on Q x Q, it is also possible to 
interpret C™ y as a family of probability measures on X n x X n x n n x Q n . We will 
sometimes use this viewpoint in the following section. It is especially useful when 
the RDS is obtained by sampling a continuous-time process. 



Remark 2.5 It will sometimes be useful to have an explicit way of telling whether 
a step of ^ is taken according to Q%? y or according to R^ y (i.e. whether the trajec- 
tories couple or not). To this end, we introduce a Markov chain $ on the augmented 
phase space X x X x {0, 1} with transition probabilities 

P x,y = Qx,y X 5i + R X) y X 6 . 

The marginal of ^ on X x X is of course equal to \&. By a slight abuse of notation, 
we will also write C™ y for the probability measure on pathspace induced by ^. 

It will be useful in the sequel to have a map that "transports" the family of maps 
ipx^y on ^ n v i a th e RDS <£. More precisely, fix a pair (x, y) G X x X of starting 
points and a sequence (u>o, . . . ,ui n ) of realizations of the noise. We then define 
xo = x, yo = y, and, recursively for i = 0, . . . , n 

x i+ i = $(xi,u}i) , y i+ i = $(yi,ipxi^ yi (ui)) ■ 

This allows us to define the family of maps E" : £l n — > Q, n by 

^"Vo, • • • , W„) i-> (ipxo^yoiuo), ^ n ^ n (Wn)) • (2.10) 

Since ip x ^ y P is equivalent to P, we see that (E xy )*P n is equivalent to P n and 
we denote its density by T> xy . We also notice that the family of measures Q™ 
is obtained by transporting (S™ y )*P ra A P n onto X ra x X n with the maps <& Xl ,y l 
^xi-^i- In particular, one has the equality 

llQ^II = ll(^/P n APl= / (1AP>))P». (2.11) 
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3 Assumptions on the Coupling 

In this section, we investigate the properties of the coupling C^ y constructed in the 
previous section. We give a set of assumptions on the binding functions ip x ^ y that 
ensure the existence and uniqueness of the invariant measure for 3>. 

In order to achieve this, we want the map tp x ^ y to modify the noise in such a 
way that trajectories drawn according to tend to come closer together. This 



will be the content of Assumption A3. Furthermore, we want to know that this 



actually happens, so the noise should not be modified too much. This will be the 



content of assumptions \4 and 45. All these nice properties usually hold only in a 



"good" region of the phase space. Assumptions 41 and |A^ will ensure that visits 
to this good region happen sufficiently often. 

3.1 Lyapunov structure 

Since we are interested in obtaining exponential mixing, we need assumptions of 
exponential nature. Our first assumption concerns the global aspects of the dynam- 
ics. It postulates that <3? is attracted exponentially fast towards a "good" region of 
its state space. We achieve this by assuming the existence of a Lyapunov function 
for 



Definition 3.1 Let Q by a RDS with state space X as before. A Lyapunov function 
for <I> is a function V : X — > [0, oo] for which there exist constants a S (0, 1) and 
b > 0, such that 

[ V(*(aj,w)) P(dw) < aV(x) + b, (3.1) 
Jn 

for every x G X with V(x) < oo. 

Our first assumption then reads 
Al There exist a Lyapunov function V for <£. Furthermore, V is such that 

P{uj | V($(x,u)) < oo} = 1 , 

for every x £ X. 

For convenience, we also introduce the function V : X x X — > [0, oo] defined 

by 

V(x,y) = V(x) + V(y) . 

Notice that V is a Lyapunov function for ^ by construction. 

In some cases, when the control over the densities V Xj y is uniform enough or 
when the phase space is already compact (or bounded), a Lyapunov function is not 
needed. In such a situation, one can simply choose V = 1. 
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In our case of interest, the RDS <J> is obtained by sampling a continuous-time 
process & t at discrete times. In that setting, it is useful to have means to control 
excursions to large amplitudes that take place between two successive sampling 
times. To this end, we introduce a function W : X x Q — > [0, oo] given by 

W{x,uj)= sup V($ t (x,u)) 
te[0,l] 

in the continuous-time setting and by 

W(x,u) = V(x) 

in the discrete-time setting. In fact, any other choice of W is all right, as long as it 
satisfies the properties that are summarized in Assumption A2 below. 

Before stating these properties, we define two other functions that act on pairs 
of initial conditions that couple by 

We will assume that W and the binding functions are such that W, W x ^ y and 
W x ^ y do not behave much worse than V. More precisely, we will assume that: 

A2 There exists a function W : X x — > [0, oo] swc/j f/za? 



ess inf M^(x, w) = V(x) , (3.3a) 

uGfi 

/ W(x, u) ?{duS) < c V(x) , (Ob) 
Jn 

for some constant c > 0. Furthermore, there exist constants C > 5 > 1 
such that the estimates 

W x _ y {uj) < C(l + F(y) + W(x, lo)) S , 
W x ^ y (u) < C(l + V(x) + W(y, Lo) f , 



hold for the functions defined in (p.2|). 

The Lyapunov structure given by assumptions ^l| and |A^ ensures that W (and 
thus also V) does not increase too fast along a typical trajectory. In order to make 
this statement precise, we define for a given initial condition x G X the sets A x k C 
0°° by 

A Xjk = {u en°° \ W($ n (x,uj),uj n ) < kV(x) + kn 2 Vn > 0} , (3.5) 

where k is some positive constant. The sets A x k contain almost every typical 
realization of the noise: 
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Lemma 3.2 Let § be a RDS satisfying assumptions \1 and A2 Then, there exists 
a constant C > such that 

P°°(A Xik )>l-j, 
for every x € X and every k > 0. 



Proof. For u G we define x n = & n (x,uj). Notice that by (fO|b) and the 
Lyapunov structure, one has the estimate 

be 

E(W(x n , io n+1 )) < ca n V(x) + T — , (3.6) 
where E denotes expectations with respect to P°°. We also notice that A x ^ = 

n„>o4Swith 

A x)t = {°° I W{x n ,uj n+1 ) < W{x) + kn 2 } . 
Combining this with (f3l|), we see that 



P°°(A<ri) > 1 - C ""^) + ^ 1 - Q )" 1 
1 x ' k) ~ k V(x) + n 2 

Therefore, the worst possible estimate for P°°(A Xj k) is 

poo,, \ \ i c^ aWW + bd-a)- 1 

n=l 

which proves the claim. □ 
3.2 Binding property 

The crucial property of the coupling is to bring trajectories closer together. In order 
to make this statement more precise, we introduce the Lipschitz norm || • || L defined 
on functions g : X — ► R by 

n ii i / m , \d( x ) - 9(y)\ 

\\g\k = SU P \9(x)\ + sup — — , 

where d(-, •) denotes the distance in X. The dual norm on M(X) is then given by 

IIHIl = su p / g(x)n(dx) . 

With this definition at hand, we make the following assumption on the coupling 
part Q~ y . 
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A3 There exist a positive constant 71 and a family of constants K 1— > Ck such 
that, for every K > 0, 

Iki.nQ^sz ~ ^nQS/IL < C K e~ lin , (3.7) 
/ioWs when V(x, y) < K. 



Remark 3.3 The sub-probability kernels are smaller than the transition prob- 
abilities for the binding construction (23). Thus, (3/7) is implied by an inequality 
of the type 

E(d(x n ,y n )) < CV(x ,y )e-^ n , 

where d denotes the distance in X and E denotes the expectation with respect to the 
construction (EOl). 



Notice that this assumption is non-trivial only if our coupling is such that 
> for sufficiently many starting points. This will be ensured by the 



Q 



next assumption. 



00 

x,y\ 



A4 Let T)xy be defined as in Section |2.J[ We assume that for every K > 0, there 
exists a family of sets T xy C ft 00 and constants c\,C2 > such that the 



estimates 



D (r^) > ox , 



(P™ (a;))" 2 P n {du) < c 2 , 



(3.8) 



hold for every n > 0, whenever V(x, y) < K. The integral over F^ y in ( 3.8) 
should be interpreted as the integral over the projection ofT^ y onto its n first 
components. 



A typical choice for T^ y is T* y = A y ^ or = A x ^ n A y ^ with k suffi- 
ciently large as a function of K. In this case, Lemma \2 ensures that the conditions 
required on T^ y are satisfied. As a consequence of Assumption A4, we have 



Proposition 3.4 Let Q%? y be defined as above and suppose that assumptions A.1 
and A4 hold. Then there exists for every K a constant Ck such that HQI^J > Ck, 
whenever V(x, y) < K. 



Proof. Notice first that if jjL\ and ^2 are two equivalent probability measures with 
fi2(dx) = V{x) nx{dx), then the condition 



/ CD(x)) 2 nxidx) < c 
JA 
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implies that 

(/il A imi)(A) > , 

see, e.g. QMatOlp . Recalling we use Lemma O and the above estimate with 

Hi = P n , V = T> xy , and A = T xy . Taking the limit n — > oo and using the 
assumption on proves the claim. □ 

Our last assumption will ensure that trajectories that have already coupled for 
some time have a very strong tendency to couple for all times. 

In order to formulate our assumption, we introduce a family of sets Q\{x, y), 
which are the possible final states of a "coupled" trajectory of length n, starting 
from (x, y), and never leaving the set {(a, b) \ V(a) + V(b) < K}. For a given pair 
of initial conditions (x,y) E X 2 with V(x, y) < K, we define the family of sets 
Q\{x, y) C X x X recursively in the following way: 

Q° K (x,y) = {(x,y)} , 

Q n K +1 (x, y)= |J {($ a , fc o V a ^ b )(Lu) | u G n and W a ^ b (cu) < K} . 
{a,b)eQ n K {x,y) 

Notice that we would have obtained the same sets by reversing the directions of the 
arrows in the definition. 

We also denote by V x ^ y {uj) the density of i^%^ y P relative to P. 

A5 There exist positive constants C2, 72 and such that for every K > 0, every 
(xo, yo) S X 2 with V(xo, yo) < K, and every (x,y) € Q^(xo, yo), the estimate 

f (1 - V x ^)f P(dw) < C 2 e-^ n (l + K)( , (3.9) 

JW X ^y(uj)<K 

holds for n > ( ln(l + -fQ/72- 

This assumption means that if the process couples for a time n, the density V x ^ y 
is close to 1 on an increasingly large set, and therefore the probability of coupling 
for a longer time becomes increasingly large. This assumption is sufficient for the 
family of measures (RQ n ) I!i!/ to have an exponential tail at large values of n. More 
precisely, we have 



Proposition 3.5 Let assumptions [\1[ |A2| and |A5J hold. Then, there exists a posi- 
tive constant 73 and, for every K > 0, a constant Ck such that 

\\(RQ n )x, y \\ < C K e~ rin , (3.10) 
holds for every n > 0, whenever V(x, y) < K. 
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We first show the following elementary estimate (it is not optimal, but sufficient 
for our needs): 

Lemma 3.6 Let fJ,i,fJ>2 € Mi(X) be two equivalent probability measures with 
fJ>2(dx) = V{x) (J,i(dx). Then the conditions 

l^i(A) > 1 — £1 and / (1 — T>{x)) 2 fii(dx) < E2 , 

Ja 

for some measurable set A imply that 

( m A fi 2 )(A) > 1 - ei - e 1 / 2 . 
Proof. Define the set E C X by 

E = A n {x G X I V(x) > 1} . 

We then have 



(/xi a fi 2 )(A) = m(E) + ( v( x ) m(dx) 

JA\E 

= fii(A)- f (1-V(x))fn(dx) 
JA\E 

>Hi(A)- f \l-V{x)\in{dx) 
Ja\e 



>1- £1 -JJ (1 - V{x)f iixidx) . 



IA\E 

This shows the claim. □ 



Proof of Proposition yL^ Fix the value n and the pair (x, y). For every c n > 
V(x, y) (we will fix it later), we have the estimate 

||(RQ n ),J =/,(!- Ul^yJ 3 A P||) (KQly)(dx n ,dy n ) (3.11) 
<(KQ n x , y )(X 2 \Q n Cn {x,y)) 

+ I (1 " W Xn ->y? A P||) (ir* n Ql v )(dx n ,dy n ) . 

Now choose another value w n to be fixed later and consider for every (x n , y n ) the 
set 

B n = {lj G tt I W Xn ^ n (u;) < u> n } . 
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By the definition of Q™(x,y), its elements (x n ,y n ) satisfy in particular the in- 
equality V(x n , y n ) < c n . By Assumption A.2 and the Lyapunov structure, we have 
for every (x n , y n ) G (x, y) the estimate 



P(B n ) > 1 - 



W n 



Combining this and Assumption AS with Lemma 3.6 yields 



3<n — 'Vi 



P A Pll > 1 - C^TJ " Ce" 72n/2 (1 + 

11 I/O v 

W n 



II', 



\C/2 



as long as w n is such that 

w n >c n and n > ( ln(l + w n )/l2 



(3.12) 



It remains to give an upper bound for (tt*Q™ 3/ )(X 2 \ Q™ n {x,y)) to complete our 
argument. Define the sets A n (K) cX"x X n x fl n x n n \y 

A n (K) = {{Xi^Ui^i)^ | W( Xi ,Ui) + Wiy^rn) < K} . 

It is clear by the definition of Q'c n (x, y) that we have the equality 

«Q",,)(X 2 \ Q n Cn (x, y)) = Q™ J/ (X n x X n x fi" x fi" \ , 

where Q™ is considered as a measure on X n x X n x Q n x fi n , following Re- 
mark 2.4 Since Q™ < C™ >y , we have 



«Q^)(X 2 \ Q n Cn (x, y))<l- Cl y {A n {c n )) < C 



n(V(x,y) + l) 



for some constant C. This last estimate is obtained in a straightforward way, fol- 
lowing the lines of the proof of Lemma 3.2. Plugging these estimates back into 

1 n(V(x,y) + 1) , ^ c n ( ^„_ W2/1 , ... N c/2 



( |3.11| ) yields 

||(RQ n ky|| <C- 



W n 



At this point, we make use of our freedom to choose c n and w n . We set 

Cn = V(x, y) + e 7cn and w n = V(x, y) + e^ n , 
with 7 C and j w given by 



7c 



1 

2 + 28( 



72 and 7 U 



1 + <5C 



72 • 
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As a consequence, there exist for any 7 < j c some constants C and c such that 

\\(RQ n ) x J < C(l + V{x,y)) c e^ n , 

as long as n > ( ln(l + w n )/~/2- (Such a value of n can always be found, because 
the exponent 7^ is always smaller than 72/C) In order to complete the argument, 



we notice that (3.10) is trivially satisfied for small values of n because ||(RQ n )a; i2/ || 
is always smaller than 1 by definition: it suffices to choose Ck sufficiently big. 
The proof of Proposition [33] is complete. □ 

4 An Exponential Mixing Result 

This section is devoted to the proof of the main theorem of this paper. 



Theorem 4.1 Let <I> be a RDS with state space X satisfying assumptions |A1|-|A5 
Then, there exists a constant 7 > such that 



K-P^\\ L <C(1 + V(x,y))e 



for every (x,y) G X 2 and every n > 0. 



Remark 4.2 The proof of Theorem 4. 1 does not rely on assumptions 44 and 45 



directly, but on the conclusions of Propositions and |3.5[ Nevertheless, in the 
setting of stochastic differential equations, it seems to be easier to verify the as- 
sumptions rather than to show the conclusions of the propositions by other means. 



Corollary 4.3 If $ satisfies assumptions 41-A5 it possesses a unique invariant 
measure p,* and 

||R?-M*l| L <C(l + ^(x)) e -^. 

Proof of the corollary. To show the existence of the invariant measure we show 
that for any given initial condition x with V(x) < 00, the sequence of measures P™ 
is a Cauchy sequence in the norm || • || L . We have indeed 

\K-Pl l+k L= sup / g(z)(R? - PZ +k ){dz) 
lkll,<l^X 



sup / / <7(2)(P™ - P n )(efc) P*(d|/) 

q\\,<lJxJX 



< J x \K - PyL K(dy) < Ce^ n y x (l + V(x,y)) P k x {dy) 
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< Ce~~< n {l + V{x)) , 

where we used the Lyapunov structure to get the last inequality. 

The claim now follows immediately from the theorem, noticing that if //* is an 
invariant measure for <£, then 

h 

V(x) fjLxidx) < 



1-a ' 



due to the Lyapunov structure and the fact that the dynamics immediately leaves 
the set V^ioo). □ 

Before we turn to the proof of Theorem |4.1| , we introduce some notations and 
make a few remarks. By iterating (E^), one sees that 



C~ =Q^ + E( c °° R Q n W' (4-D 

n=0 



where the symbol (C°°RQ n ) :r . is to be interpreted in the sense of (|2.l|). This 



expression is the equivalent, in our setting, of Lemma 2.1 in [ |Mat01| ]. Using ( |4. 1[ ), 
the Markov chain ^ can be described by means of another Markov chain T on 
Y = (X 2 x N) U {*}, where * corresponds to "coupling for all times" in the sense 
of Section \1.3\ First, we define 

46 

K = - , K = {(x, y) | V(x, y) < K } , (4.2) 

1 — a 

where a and b are the constants appearing in the Lyapunov condition. This set is 
chosen in such a way that 

/ V(x, y) C xo yo (dx, dy) < ^-^V(x , yo) , V (x , yo) £ K . (4.3) 
JXxX 2 

At time 0, T is located at (x, y, 0). If it is located at (x, y, n) and (x, y) Kq, 
then it makes one step according to C X:V and n is incremented by one: 

If T is located at (x, y, n) and (x, y) E Kq, then it has a probability || of jump- 
ing to * and a probability ||(RQ m )a; lJ( || of making m steps according to (RQ m ) x ,y- 

oo 

P (x,y,n) = \\Q™y\\$* + ^2 7r m( R Q m )^,y X <Wm+l ■ 
m=0 
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If T is located at *, it remains there: 

P* = s* . 

The Markov chain T induces a family of probability measures on Y°°. Let 
f : Y°° — > N U {00} be the function that associates to a sequence of elements in 
Y the largest value of n that is reached by the sequence (f = if the sequence is 
equal to * repeated). We also define k : Y°° -^NU {00} as the value of n attained 
at the first non-vanishing time when the sequence hits the set Kq x N (k = 00 
if this set is never reached). The construction of T is very close to the coupling 
construction of [MatOlp. 

is 



The crucial observation for the proof of Theorem |4T 



Lemma 4.4 Let & be a RDS with state space X satisfying assumptions Al and A3 
and let T be defined as above. Then, there exists a constant C such that 

IIP; " P, n || L < > n/2}) + Ce-W2 > 

for every (x,y) G X 2 and every n > 0. 



Proof. Recall the Markov chain ^ defined in Remark 2.5. We define a function t\ 
on its pathspace by 

n : X°° x X°° x {0, 1}°°^NU {00} 

{(xi,yi, 6i)}~! i-> inf{n [ (x n , y n ) G K Q and k = 1 Vi > n} . 

Combining ( |2.2| ) with Assumption and the definition of t\, one sees that 
IIP" " P;iL < C~ ({n > n/2}) + Ce"^ 2 . 

From the construction of T and the definition of f, we see furthermore that the 
probability distributions of t\ under C^°„ and of f under P£° are the same. □ 



Proof of Theorem JO) It remains to show that P^ y ({f > n/2}) has an exponential 
tail. The key observation is the following. Let i„eNU{- 00} with n > be a 
Markov chain defined by 

(—00 with probability p*, 
Xn = , x n+ \ = < . . , , ... 

l_ x n + m with probability p m , 

where rn > 1 and, of course, + X)m=i = 1- 

Lemma 4.5 If the p m have an exponential tail and we define r = max n x n , then 
the probability distribution of t also has an exponential tail. 
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Proof. The claim is an easy consequence of Kendall's theorem, but for the sake 
of completeness, and because the proof is quite elegant, we outline it here. Define 
the analytic function p(Q = J2m=iPmC m an d define q n as the probability of r 
being equal to n. Notice that, because of the exponential tail, p is analytic in a disk 
of radius strictly bigger than 1 around the origin. A straightforward computation 
shows that qo = p* and, for n > 0, 

In = P*(pn + Y PkiPk 2 + PkiPk 2 Pk 3 + ■ 

ki+k2=n ki+k2+ks=n 

which is equal to the nth Taylor coefficient of the function 

P* 



q(0 



i-p(0 



Since p(l) = 1— p± < 1, there exists an e > such that p(l + e) < 1. Furthermore, 
since the p n are all positive, one has the estimate |p(0| < p(|C|)- Using Cauchy's 
formula on a circle of radius 1 + e, one gets 

i ^ P* 1 



1-P0- + e) (1 + e) n ' 
which shows the claim. □ 



Before we prove Theorem in full generality, we restrict ourselves to the case 
when (x,y) € Kq. It follows from the construction that f (seen as a random 
variable under the distribution induced by P£°„) is dominated by the process x n 
constructed above with the tail distribution of the p m being equal to 

p m = sup_ P% y ({k = m}) . 

(x,y)£Ka 

This means that we define as 

oo 

m* = infjm | ^ p n < l| , 

n=m 

and then set p m = p m for m > m*, p m = for m < m* — 1, and p m ,-i in such 
a way that the p n sum up to 1. 

Because of Lemma \i.5\ it suffices to show that the tail distribution of the p rn 
decays exponentially. We thus estimate the quantity P£°y({K > n}). To this end, 
we introduce the function : X°° x X°° — > N U {oo} defined by 

T^ix, y) = inf{n > | (x n , y n ) € K } . 
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Notice that, in order to have k > n, there are two possibilities. Either the first 
step of T is taken according to (RQ m ) x ,y with some m > n/2, or the correspond- 
ing realization of ^ stays outside of Kq for a time longer than n/2. This yields the 
estimate 

oo 

P£„({k >n})< Yl IKRQ m kJ + j SU P _ C S°o,«>({ r * > ™/2}) , 

m=n/2 (^0,S/o)G^O 



holding for (x, y) G Kq. The first term has an exponential tail by Proposition 3.5 



The second term has also an exponential tail by (fL3|) and standard Lyapunov tech- 



niques (see e.g. [ |MT94| , Thm 15.2.5]). This concludes the proof of Theorem |4T 
for the case (x,y) G Kq. 

In order to conclude the proof for the case (x, y) Kq, notice that 



oc 



V%y({* > n}) < E C ^({ T * = m » SU P . P^o« f >n~m}) 
m=l (a;o,yo)e-ft"o 

oo 

< - sup P~, yo ({f > n/2}) + E C- ({r* = m}) . 

(^0,yo)GKo m=n/2 

The first term is bounded by the construction above. The Lyapunov structure im- 
plies that there exists a constant 7 > such that the first hitting time satisfies 
^(x,y)e 7r * = C(V(x, y)) for every (x, y) G X 2 (see again [ |MT94l Thm. 15.2.5]). 



This allows to bound the second term and concludes the proof of Theorem |4.1[ □ 

5 Application to Stochastic Differential Equations 

In this section, we will see how to apply Theorem ^T] to the case when the RDS <I> is 
constructed by sampling the solution of a (possibly infinite-dimensional) stochastic 
differential equation. We will restrict ourselves to the case where the equation is 
driven by additive white noise. The case of multiplicative noise requires further 
estimates, but can also be described by the formalism exposed here. 
Consider the equation described by 

dx(t) = Axdt + F(x) dt + Q dw(t) , x(0) = x , (5.1) 

where x belongs to some separable Hilbert space H, u> is the cylindrical Wiener 
process on some separable Hilbert space W, and A, F and Q satisfy the following 
assumptions: 

Bl a. The linear operator A : D(A) — > Tt is the generator of a strongly continu- 
ous semigroup on TL. 
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b. The operator e Q : W — > TL is Hilbert-Schmidt for every t > and one 
has the estimate 

I \\e M Q\\l s dt < oo . (5.2) 
Jo 

c. The nonlinear operator F : D(F) — > TL is such that, for every xq £ TL, 
there exists a unique, continuous stochastic process x(t) such that x(s) £ 
B(F)for s > and 

x (f) = e At x + /* e A{t - s) F{x(s)) ds + f e A(t ' s) Q duj(s) , (5.3) 
J o Jo 

for every t > 0. 



Remark 5.1 This assumptions simply states that there exists a unique weak so- 
lution to (|Q1). Notice that we do not make any assumptions on the tightness of 



the transition probabilities for (5.1). As a consequence, existence and uniqueness 



results for invariant measures can in principle be deduced from Theorem p.5| below 



even in cases where the semigroup e At is not compact. 



In order to recover the formalism used in Section 0, we follow [DPZ92b] and 



introduce an auxiliary Hilbert space W such that there exists a continuous embed- 
ding i : W ^ W, which is Hilbert-Schmidt. We can now set Q = C ([0, 1], VV), 
the space of continuous >V-valued functions that vanish at 0, and define P as the 
Wiener measure on VV with covariance operator a* . 

We define $ : H x Q, — > H as the map that solves (5.3) up to time 1 given 
an initial condition and a realization of the noise. This map is defined P-almost 
everywhere on Q,. We also denote by <I> t : H x — > TL the map that maps an 



initial condition and a realization of the noise onto the solution of ( |5.3| ) after a time 
t. 

Our next assumption is the existence of an appropriate Lyapunov function V: 



B2 There exists a measurable function V :TL — > [0, oo] and constants a < 1 and 
b,c,d>0 such that 

E w V($(x,cj)) < aV(x) + b, 

E w ( sup V($ t (x, u)j) < cV(x) + d , (5.4) 
v o<t<i ' 

P({uj | V($(z,w)) = oo}) = 0, 

for every x G TL. Furthermore, V dominates the norm in TL in the sense that 
\\x\\ < C(l + V(x))for some constant C. 
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As is Section ^, we define V(x, y) = V{x) + V(y). 

Remark 5.2 Take TL equal to l3(0) for some regular bounded domain O C R rf , 
A a second-order elliptic differential operator on O with sufficiently smooth coef- 
ficients, and F any polynomial non-linearity of odd degree having the correct sign. 



The assumptions Bl and [B2| are satisfied with V(x) = \ \x ||^ for every power p > 1 
and every "reasonable" norm || • ||^, as long as Q is "small" enough. (One can for 
example take for || ■ ||* the L°° norm or a Sobolev norm.) 



We now turn to the binding construction for the problem (p. 1|). Take a function 



G : TL 2 — ► W and consider the 7^ 2 -valued process (x, y) solving 



dx(t) = Axdt + Fix) dt + Q dio(t) , (5.5a) 



dyit) = Aydt + F(y) dt + Q G(x, y)dt + Q duj{t) . (5.5b) 



Notice that the realization of oj is the same for both components. The process 
(5.5) yields our binding construction for (|5~1|). In order to give sense to (^b), we 



introduce the H-valued process git) = y(t) — x(t) and we define it pathwise as the 
solution of the ordinary differential equation 

q = Ag + Fix + g) - F(x) + Q G(x, x + g) . (5.6) 

We assume that G is sufficiently regular to ensure the existence and uniqueness 



of global weak solutions to (5.6) for almost every (with respect to the measure 
on pathspace induced by <I> t ) continuous function x : [0, oo) — > TL. This allows 
us to define the stochastic process yit) by y(t) = x(t) + git). We will denote 
by : X x X x O - >X the map that solves (|3|b) up to time t, given an initial 
condition for x and y, and a realization of the noise. 

The above construction is invertible in the following sense. Consider the H 2 - 
valued process 

dxit) = Axdt + Fix) dt-Q G(x, y)dt + Q d&it) , fT^a) 
dyit) = Aydt + F(y) dt + Q dCo{t) , (|/7|b) 

where we give sense to the equation for x as above by setting g = y — x and solving 

& = Ag + Fiy)- Fiy- g) + QGiy- g,y) . 



We denote by$t:XxXxil^X the map that solves (5.7 a) up to time t, given 



an initial condition for x and y, and a realization of the noise Q E f2. We see that 



(5.7) can be obtained from (p3h by the substitution dto = du + Gix,y)dt and 



Application to Stochastic Differential Equations 



27 



a renaming of the variables. This observation yields the invertibility of the maps 
ift x ->y defined in Eq. ( 5.12 ) below. 



We will state two more assumptions to make sure that the conclusions of The- 



orem |4JJhold. First, we want G to become small as x and y become close. 
B3 There exists a constant C > and exponents a, /3 > such that 

\\G(x, y)\\ 2 < C\\x - y\\ a (l + V(x, y)f , (5.8) 
for every x,y € H. 

The last assumption ensures that the process y(t) converges towards x(t) for large 
times. 



B4 There exist positive constants C and 7 such that the solutions of (5.5) and (5.7) 
satisfy 

\\$ t (x,u)-$t(x,y,uj)\\ <Ce-^(l + V(y) + supV($ s (x,u))) , (^a) 

v s<t ' 



\$ t (x,y,uj)-$ t ty,u)\\ <Ce-* (1 + V(x) + supV(<S> s (y,u)) ), (^b) 

v s<t ' 



for P°° -almost every lo € Furthermore, there exists 5 > such that one 
has the estimate 



V{$ t (x, y, w)) < C(l + V(y) + sup V($ s (x, w))) , @a) 

V s<t ' 

V($ t (x,y,u)) <c(l + V(x) + supV($ s (y,u))) 5 , (gT^b) 

v s<t 



for P°° -almost every lo £ Q°° a«cf every t > 0. 



Remark 5.3 One important particular case is the choice V(x) = \\x\\ p , where the 



power p is chosen in such a way that ( |5.9[ ) is satisfied. Notice that in this case, the 



estimates ( p.lO| ) are a straightforward consequence of (p.9p. 

The function G is then only required to satisfy a bound of the type 

\\G(x,y)\\ 2 <C\\x-yr(l + \\x\\ + \\y\\) q , 

with a and q some arbitrary positive exponents. 

It is also possible to choose V(x) = \\x\\*, with || • ||* the norm of some Banach 



space B C Tt. In this case, (5.9) with the ^-norm replacing the H-norm in the 



left-hand side implies (5.10) 
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Remark 5.4 An equivalent way of writing (|5^9|b) is 



\$ t (x,uj)-$ t (x,y,u)\\ < Ce 7i (l + V(x) + supV($ s (x,y,iv)) 

K s<t 



(5.11) 



The equation ( 5.11 ) will be more natural in our examples, but (5.9) is more sym- 
metric and more convenient for the proof of Theorem W3 below. 



All these assumptions together ensure that exponential mixing takes place: 



Theorem 5.5 Let A, F and Q be such that assumptions 
If there exists a function G : TC 2 — > VV such that assumptions B3 and 



and |B2| are satisfied. 

hold, 



then the solution of (5.1) possesses a unique invariant measure /x* and there exist 
constants C, 7 > such that 

\K-fi*\\L<Ce-^(l + V(x)) . 



Proof. It suffices to show that assumptions |A1|-|A5| hold. Assumption follows 
immediately from Assumption B2. In order to check the other assumptions, we 
define the various objects appearing in the previous sections. We have already seen 
that X = HM = C ([0, 1], VV), and $ is the solution of (|j]) at time 1. 
We define the function W :TL x fl — > [1, 00] by 

W{x,uj)= sup V($t(x,v)) ■ 
te[o,i] 



The estimate ( p.4| ) and the definition ensure that W satisfies (|3.3|a) and (p.3|b). The 
bound (5.1C) ensures that Assumption A.2 is also satisfied. 

It remains to define the binding functions tp x ^ y and to compute the densities 
T>™ y . According to the constructions ( |53| ) and ( |5/7| ), we define for (x, y) G H 2 the 
binding functions 



(ip x ^ y (uj))(t) = u(t) + / G($ s (x,u)),$ s (x,y,uj))ds , 



(5.12a) 



{i) x ^ y (u)){t) = u(t) - / G(<f> s {x,y,uj),<S> s (y,u}))ds , (5.12b) 
Jo 

with t € [0, 1]. It follows from the construction that these functions are each 
other's inverse. Furthermore, if we identify Q n with Co([0, n], VV) in a natural way, 
we see that the maps H™ y introduced in ( 2.10 ) are obtained from ( 5.12| ) by simply 
letting t take values in [0, n]. These observations allow us to compute the densities 
V™ y (u>) by Girsanov's theorem: 
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Lemma 5.6 The family of densities T>™ JuS) is given by 

D^(o;) = exp(y G($ t (x,y,u),<f> t (y,uj))clw(t)--j \\G(. . .)|| 2 dt) , 
where the arguments of G in the second term are the same as in the first term. 

Proof. If we can show that Girsanov's theorem applies to our situation, then it 
precisely states that 

with P n (doo) = V™ y {uj) P n (dco), and V™ y (oj) defined as above. Applying rp*^ y to 
both sides of the equality shows the result. 



We now show that Girsanov's theorem can indeed be applied. By [DPZ92b 
Thm 10.14], it suffices to verify that 



/ 

Jo? 



Vl^) P n {dUj) = l 



(5.13) 



This can be achieved by a suitable cut-off procedure. Define for N > the function 

Gn(x, y) ■■ 



G(x,y) if \\G(x,y)\\ <N, 
otherwise, 



and define 



V%?{w) = exp(j G N ($ t (x,y,u;),$ t (y,uj))duj(t)--j o \\G N (. . .) 



dt 



It is immediate that (5.13) holds for T>™y- Furthermore, it follows from Assump- 



tion 



B4 that there exists a constant Cn such that V™> y (u>) = V™ y (uj) on the set 



T N = {uj <e V n | V($ s (x, y, u>), ^ s {y, to)) < C N Vs G [0, n]} . 

The sets T N satisfy lim^oo P n (lV) = 1 by and tfxWfc). This shows that 
( 5.13 ) holds. Notice that the a-priori bounds of Assumption [B4| were crucial in 
this step in order to apply Girsanov's theorem. The bound (^8) alone could lead to 
exploding solutions for which Girsanov's theorem does not apply. □ 



It is immediate that Assumption |A3| follows from Assumption |B4| and the definition 
of the norm II • || T . 



We now turn to the verification of Assumption |A4j . Recalling the definition 
we see that 



(3.5), we see that in our case 

{uj € | V($ S (V, w)) < k(V(y) + s 2 ) Vs > 0} 
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As we see from the definition of B y k , a natural definition for a truncation G y ^ of 
G (this time the truncation additionally depends on time) is 

G y , k (x,y,t) = \ G ^ ifV(y)<k(V(y) + n 
{ otherwise. 

As above, we define 

P^V) = exp(y o G yjk ($t(x,y,uj),<j>t(y,u),t) du(t)-- J \\G y>k (. . .)f dt) . 

By definition, T>™' y (uj) = V™ y {u)) for every u> G B yk . Setting £ = 5(a + jJ), we 
thus have the estimate 

/ {Vl y {u))~ 2 P n (dw) < f (V^{u)y 2 P n (du) 

J^y,k JBy,k 

' j B ex?(l0 £ \\G y , k ($ t (x,y,u;),$ t (y,u),t)\\ 2 dt) P n {du) 

exp (10 r Ce'^ (l + V(x) + sup V($ s (y, u))) ? dt] P n (duj) 
<y, k V Jo v s<t ' J 

< z*p(c£ e~ 7 *(l + kV(x, y) + kt 2 ^ dt) . 

In this expression, we used the Cauchy-Schwarz inequality to go from the first to 
the second line, and we used assumptions B3 and |B4| to go from the second to the 
third line. Since the integral converges for n — > oo, the bound is uniform in n and 
Assumption A4 is verified. 



< 



< 



1/2 

1/2 



The verification of Assumption \5 is quite similar. Fix some positive constant 
K > and use again the cutoff function 

G N (x,y) = l G ^ '*\\Gi*,V)\\ 2 <N. 
1 otherwise. 

The precise value of N (as a function of K) will be fixed later. We also fix a 
pair (xo,yo) G with V"(xo>J/o) < K, a value n > 0, and initial conditions 
(x,y) <G Q^(xo,yo)- By the definition of Q^ixQ^o), there exists an element 
uj £ Q n such that 

(x,y) = (® n (xa,y ,oj),$ n (yo,ti)) , (5.14) 

and such that 

sup V(<& s {x , y , cD), & s {yo, u)) < K . (5.15) 

se[0,n] 
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Following the statement of Assumption A.5, we define the set 



D 



K 



sup V($ t (x,y,Lj),<S> t (y,uj)) < k\ , 
*e[0,i] 



which is equal in our setup to the set over which integration is performed in (3.9). 
Being now accustomed to these truncation procedures, we define again 

V% y \u) = exp(jf * G N ($ t (x,y,u), $ t (.y,u)) dw(f) - i jf \\G N (. . .)|| 2 dt) . 

By ( 5.14 ) and the cocycle property, we can write the integral in the above expres- 
sion as 

l ^ 

G N ($ n+ t(x , y , uu), $ n+ t(y ,ujuj)) duj{t) , 

where ljuj is the realization of the noise which is equal to Co for a time n and then 
to to for a time 1. Using the a-priori bound (5.15) as well as assumptions B3 and 
153, we thus see that there exists a constant C such that the choice 



o 



N = Ce- a ~ fn (l + K) 



a+f3 



ensures that T>^ y \u>) is equal to T> Xty (u) for u € 



K 



We then have the estimate 



/ (1 - V x , y {uj)f P(dw) < [ (1 - V ( ^{oj)) 2 P{dw) 
= ( (V { x K ^)f P{dto) - 1 

J i2 

< ^exp(6^ 1 \\G N ($ t (x,y,Lo),<S> t (y,Lo))\\ 2 dt) P{dco) 



1/2 



< exp(Ce- a7n (l + K) 



1 . 



If we take n > /31n(l + K)/~f, the exponent is bounded by C and there exists a 
constant C' such that 

/ (1 - V x , y {oj)f P{du) < C'e-^ n {\ + KT+P , 
JBff,, 



thus validating Assumption |A5| with 72 = 07 and Q = a + (5. 
The proof of Theorem 5.5 is complete. 



□ 
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6 Examples 



Numerous recent results show that the invariant measure for the 2D Navier-Stokes 
equation (and also for other dissipative PDEs) is unique if a sufficient number of 
low-frequency modes are forced by the noise OBKLOOaj [BKL00b| , |EMS01| , pVlatOl 



ELOTL |KSO0| , |KS0T| , fvlY01[ ]. These results are not covered directly by Theo- 
rem 5.5|, but some more work is needed. The reason is that the sets A% defined in 



(3.5) are not the natural sets that allow to control the influence of the low-frequency 
modes onto the high-frequency modes in the 2D Navier-Stokes equation. 



On the other hand, our formulation of Theorem makes it quite easy to 
verify that the n-dimensional Ginzburg-Landau equation (in a bounded domain) 
shows exponential mixing properties, if sufficiently many low-frequency modes 
are forced by the noise. We verify this in the following subsection. 

6.1 The Ginzburg-Landau equation 

We consider the SPDE given by 

du = (An + u-u i )dt + Q dw(t) , u(0) = u , (6.1) 

where the function u belongs to the Hilbert space 

n = L 2 ([-L,L] n ,R) , 

and A denotes the Laplacean with periodic boundary conditions. The symbol 
Q du{t) stands as a shorthand for 

N 



Q duj(t) = qi&i diOiit) 



i=i 



where {qi]f =l is a collection of strictly positive numbers, denotes the ith eigen- 
vector of the Laplacean, and the Ui are N independent Brownian motions (for some 
finite integer N). We also denote by A, the eigenvalue of A corresponding to ei 
and we assume that they are ordered by . . . < A2 < Ai < 0. We will see that it 
is fairly easy to construct a binding function G for which the assumptions of the 
previous section hold with V(u) = \\u\\, where || • || denotes the norm of H. 



In [ pPZ96| ], it is shown that ( p.l[ ) possesses a unique mild solution for initial 
conditions uq G L°°([— L, L] n ). It is straightforward to extend this to every initial 
condition uq G Tl, by using the regularizing properties of the heat semigroup. 



Thus, Assumption Bl holds and we denote by P* the transition probabilities of the 
solution at time t starting from u. We have the following result: 
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Theorem 6.1 There exist positive constants C and 7, and a unique measure /z* € 
Mi (Tt) such that 

||P*-^ L <C f e-T''(l + |M|), (6.2) 
for every u GTC and every t > 0. 



Proof. We verify that the assumptions of Theorem 5.5 hold. The bounds required 
for the verification of Assumption B2 can be found in [ Cer99 , DPZ96 ], for exam- 
ple. 

It remains to construct the forcing G : Tt 2 — > and to verify assumptions B3 
and B4. We consider two copies u\ and u 2 of (|6T|), with the noise du replaced 
by dw + G dt in the second copy. We also denote by g = u 2 — u% the difference 
process. It satisfies the differential equation 



g = Ag + g - q(u\ + u\U2 + u\) +Q G(ui,u 2 ) ■ 



(6.3) 



We can project (6.3) onto the direction given by e^. This yields 



(A fc + l)g k - (o(u\ + uiu 2 + u| 



Qk 

for k = 1, . . . , N and 

Qk = (Afc + V)Qk - (q{u\ + «l«2 + 
for k > N. We choose Gk for = 1, . . . , N as 

2 + A fc 



+ g fc G fc (ni,u 2 ) 



"2! 



G fc (-ui,u 2 ) 



Qk 



-Qk 



Since can only be defined this way if qk 7^ 0, we use at this point the fact that 
the noise acts directly and independently on every unstable mode. This requirement 
can be significantly weakened with the help of Theorem |5.5[ We will focus next 
on more degenerate problems which illustrate the power of our technique. 

This choice satisfies Assumption B3 . With this choice, we can write down the 
evolution of the norm of g as 



d\\g\\ 
dt 



2(g, Ag) - 2(g, g{u\ + uiu 2 + u|)) , 



with A the linear operator given by adding up the contribution of A + 1 and the 
contribution of G. By the condition we imposed on N, there exists a constant 
a > such that (g, Ag) < — a|[^|| 2 . Furthermore, one has 



{g, g{u\ + uiu 2 + u 2 2 )) > 
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We thus have the differential inequality 

< -2a||£>|| z , 



d \\e\\ 2 ^ „...n. ,,2 



dt 

which implies that 

11^)11 <e- a *||^(0)|| . 
This implies by Remark B3] that Assumption 53 is also satisfied. The proof of 



Theorem 6.1 is complete. □ 



6.2 A reaction-diffusion system 

Consider the following reaction-diffusion system: 

du = (An + 2u + v - u 3 ) dt + dw(t) , 

„ (6.4) 
dv = (Av + 2v + u-v 6 )dt, 

where the pair (u, v) belongs to the Hilbert space 

H = H U ®H V =L 2 ([-L,L],R)0L 2 ([-L,L],R) . 

The symbol A again denotes the Laplacean with periodic boundary conditions, 
and du is the cylindrical Wiener process on TC U (meaning that it is space-time 
white noise). 

Notice that, because of the presence of v, this system does not satisfy the as- 
sumptions stated in the papers mentioned at the beginning of this section. In other 
words, even though the forcing is infinite-dimensional, not all the determining 



modes for (6.4) are forced. 



We take as our Lyapunov function 

V(U,V) = \\u\loo + Halloo , 

with || • | |oo the L°° norm. As in the previous subsection, one can show that with this 
choice of V, our problem satisfies assumptions pT and B3. We will now construct 



a binding function G which satisfies assumptions |B4| and g3j. We consider, as in 
( |5.5[ ), two copies (ui, v\) and (1*2,^2) of the system (6.4), but the noise is modified 
by G on the second copy. We also define g u = 112 — u\ and g v = V2 — v \. We then 
have 

Q u = AQ U + 2Q U + Q v - Q u {u\ + U X U 2 + u\) + G(u%, U 2 , V I , V 2 ) , ,. 

2 2 

Q v = Ag v + 2g v + q u - g v (v 1 + VlV 2 + v 2 ) . 
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Our construction of G is inspired from the construction we presented in Section [P|. 
We introduce the variable ( = g u + 3g v . Substituting this in ( |6.5[ ), it defines the 
function G if we impose that the equation for ( becomes 



C = AC-C, 



(6.6) 



so that ||C(*)|| 2 < llC(0)|[ 2 e *. Notice that the function G achieving this identity 
satisfies a bound of the type 

||G|| < + ||^||)(1 + H^llloo + IMloo + Hoc + ll^lloo) 2 , 



thus satisfying Assumption [B3| . It remains to show that Assumption |B4| is satisfied. 
The equation for g v reads 

q v = Ag v - q v + (- g v (vf + viv 2 + vl) . 

Therefore, the norm of g v satisfies 



\Qv(t)f < \\Qv(0) 



, e - t+ i+\m\\\- t 



This in turn implies, through the definition of ( and the bound on ||C(0||> that a 



similar bound holds for ||g u (t)||. This shows that the bound (5.S) is satisfied. Simi- 
lar estimates hold with the L°° norm replacing the L? norm, and so Assumption B4 



is satisfied by Remark 5.3 



In fact, a straightforward computation, which can be found in [ |Cer99| , |Hai01 



MS01 ] for example, shows that in this example, one can get a uniform estimate on 
the Lyapunov function V. More precisely, there exists a constant C such that for 
all initial conditions x G H, 



f V(y)P x (dy)<C 
Jh 



(6.7) 



Denoting by VI the semigroup acting on measures generated by the solutions of 
x4|), we thus have: 



Theorem 6.2 There exists a unique probability measure £ Mi (Tt) such that 
Vtfi* = n*for every t > 0. Furthermore, there exist constants C and 7 such that 



(6.8) 



for every u € Mi(H). 
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Proof. Combining (6.7) with the results of Theorem 4.1 
to what was done in the proof of Corollary 



and a computation similar 



\ we get ( |6.8[ ) for integer times. The 
generalization to arbitrary times is straightforward, using the fact that the growth 
rate of the difference process (g u , g v ) (with G = 0) can easily be controlled. □ 



Remark 6.3 In fact, the dependence on u in the right-hand side of (6.2) can be 



removed similarly by checking that an estimate of the type ( J6-7| ) is verified for the 



solutions of the stochastic Ginzburg-Landau equation (6.1). 



6.3 A chain with nearest-neighbour interactions 

In the previous example, the noise acted on infinitely many degrees of freedom in 
a non-degenerate way. As a consequence, one step was sufficient to transmit the 
noise to the entire system. We will now look at a much more degenerate system, 
where the noise acts on only one degree of freedom, although an arbitrary number 
of modes are linearly unstable. 
Our model is given by 



cLxq = (a 2 XQ + x\ — Xq) dt + doj , 



x k = (a -k )x k + + x k 



+i 



4 



1,2,... , 



(6.9) 



where a £ R is an arbitrary constant. One should think of the deterministic part 



of (5.9) as a very simple model for a dissipative PDE of the Ginzburg-Landau 
type. We will consider in the (real) Hilbert space TL = I 2 endowed with its 
canonical orthonormal basis {e^j^Q. It is easy to verify that (5.9) possesses a 
unique solution. We denote again by V* the semigroup acting on measures v G 
M(£ 2 ) generated by We will show 



Theorem 6.4 For the problem ( |6.9p , there exists a unique probability measure 
p>* £ Mi(£ 2 ) such that T^/u* = [i* for every t > 0. Furthermore, there exist 
constants C and 7 such that 



\V* t v 



for every v £ Mi(7i). 



Proof. We will take as our Lyapunov function V(x) = \\x\\ p for some power of p 
to be fixed later. It is a straightforward task to verify that the dynamics generated 
by ( |6.9[ ) does indeed satisfy assumptions Bl and [B2| for this choice of V . 



We next show that a bound of the type (5.7) holds for the solutions of (6.9), 
thus yielding the uniformity in the convergence towards the invariant measure 
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Let us define the process y(t) € I 2 by y(t) = x(t) — co(t)eo- This process then 
satisfies the following system of differential equations: 

i/o = a 2 (y +u) + yi- (yo + w) 3 , 

yi = (a 2 -l)yi + y + y 2 -yf + uj, (6.10) 
Vk = (a 2 ~ k 2 )y k + yk-i + yk+i - y\ , k = 2,3,... . 



We denote by \\y\\oo the norm given by sup fc \y k \. It follows from [Lun95] that 



( |6.1C ) possesses a strong solution for positive times. Furthermore, from ( 6.1C ) and 



the definition of the || • ||oo-norm, we see that there are constants c\,ci > such 
that 

D ~Pt°° < -cilll/HL + c 2 (l + Ht)\ 3 ) , (6.11) 

where D^/Dt denotes the left-handed lower Dini derivative. A straightforward 
computation shows that ( |6.11| ) implies that there exists a constant C such such that 

||l/(l/2)||oo < C sup (1 + \uo{t)\) , 
te[0,i/2] 

independently of the initial condition. In order to conclude the proof of the estimate 
(|6.7|), it suffices to show that there exists a constant C such that 



E||y(l/2)|| < C(l + Halloo) . 
This follows easily from the dissipativity of the nonlinearity in H and the fact that 



the semigroup generated by the linear part of (6.10) is bounded from £°° into I 2 . 
It remains to verify that the assumptions Bl — are indeed satisfied for some 



binding function G. This, together with the uniform bound obtained above, shows 



that the conclusions of Theorem [^J hold. As for the toy model presented in Sec- 
tion |lTT| , we consider a process y G i 2 governed by the same equation as ( |6.9[ ), but 
with duo replaced by du> + G(x, y) dt. We then introduce the difference process 
g = y — x, which is given by the solution of 

Qo = a 2 g + qi - Qu{xl + x y + y%) + G(x, y) , 

2 2 2 2 (6-lA> 

Qk = (a -k )g k + Qk+i + Qk-l ~ Qk(x k + x k y k + y k ) . 

The aim of the game is to find a function G for which g(t) — > as t — > oo. We can 
split ( |6.12 ) into "low modes" and "high modes" by introducing 



K = mf{k > | k 2 - a 2 > 3} . 

At the level of the Hilbert space £ 2 , we set I 2 = Hl®'Hh, where Hl ~ R fc * 
is generated by eo, • • • , ek*-l an d T^H is its orthogonal complement. We denote 
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by ql and qh the components of g and by Ah the restriction (as a symmetric 
quadratic form) of the linear part of ( |6.9| ) to Tin- It is by construction easy to see 
that 

(qh,A h q h ) > \\qh\\ 2 ■ 
As a consequence, we have for ||^h|| 2 the following estimate: 

\\QH(t)f < e-*||^(0)|| 2 + ] [ l e^ldisrf ds , (6.13) 

4 Jo 

where we defined (j = Qk,-i- (The reason for renaming Qk,-\ this way will 
become clear immediately.) It remains to construct G in such a way to get good 
estimates on ||^L(t)|| 2 . In order to achieve this, we use again the same method as 
for the first toy model. The variable (j obeys the equation 

Cl = C\Q k ,-i + 0fc„ + Qk,~2 ~ 0fc.-l(£fc,-l + £fc.-l2/fc.-l + 2/fc,-l) ' 

with some constant c\ € R. We thus introduce a new variable £2 defined by 

C2 = (ci + 1)^-1 + Qk, + Qk,-2 - e'fe.-iC^L-i + Sfc»-iyfc.-i + yl»-i> ■ 

It is important to notice two facts about this definition. The first is that it yields for 
|Ci 1 2 the estimate 

\d(t)\ 2 < e-*|Ci(0)| 2 + ~ f e l - s \C2{s)\ 2 ds . (6.14) 
4 Jo 

The second is that (2 can be written in the form 

C2 = QK-2 + Q2(Q,x,y) , 

where Q2 is a polynomial depending only on components qi, xi and with i > 
k* — 1, and such that each of its terms contains at least one factor qi. 

Now look at the equation for (2- It is clear from the structure of £2 and from 
the structure of the equations (5.9) and (5.12) that it can be written as 

C2 = — C2 + C3 , 

where 

C3 = Qk„ -3 + Qz(Q,x,y) . 

This time, the polynomial Q3 depends only on components with an index i > 
— 2. This procedure can be iterated, yielding a whole family of variables 

= Qk*-i + Qi(QiX,y) , (6.15) 
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where the Qi are polynomials depending only on indices i > k* — I + 1, and 
containing at least one factor Qi in each term. Furthermore, one gets for every Q 
the estimate 

i r-t 

|2 



\(i(t)\ 2 < e~*|0(0)| 2 + \J e*- s |0+i(s)| z ^ 
Notice that ( 6.16| ) is valid for I < k*. For I = k*, we have 

Cfc. = Qk*+i(Q, x, y) + G(x, y) . 



(6.16) 



(6.17) 

It thus suffices to choose G in such a way that ( |6. 17| ) becomes 

Cfc, = -C*. • (6-18) 

Denoting by £ the vector Cl> ■ ■ ■ ) Cfc*> we § et from ( |6.16 ) and ( 5.18 ) the estimate 

IICWII 2 < Ce-^||C(0)|| 2 , (6.19) 

for any 7 € (0, 1). Plugging this into ( |6.13 ) yields for \\qh\\ the estimate 

||^(i)|| 2 <C e -^(||to(0)|| 2 + ||C(0)|| 2 ) 

<Ce^||^(0)|| 2 (l + ||x(0)|| + ||y(0)||) p , 



for some constants C, 7 and p. It remains to get an estimate on \\ql\\- From ( 5.19 ) 
and the definition of (1, we get immediately 

\Qk,-i(t)\ 2 < Ce^||£(0)|| 2 (l + ||x(0)|| + ||y(0)||) p . 
From the definition of £2. we get 

|a,-2(t)| 2 < C(\( 2 (t)\ 2 + \Q 2 (Q(t),x(t),y(t))\ 2 ) . 

But we know that Q2 only depends on components of g, x, and y with an index 
i > k* — 1. These are precisely the components of g on which we already have an 
estimate. We thus get 

\Qk,-2(t)\ 2 < Ce^||e(0)|| 2 (l + K0)|| + ||y(0)|| + ||x(t)||) p , 

for some other power p. Here we used the fact that y(t) = x(t) + g(t) to get rid of 
||y(t)[| in the estimate. The same reasoning can be applied to g^-3, and so forth 
down to go. We finally get 

\\QL(t)f < Ce^ \\g(0) || 2 (1 + ||x(0)|| + ||y(0)|| + \\x(t)\\) p , (6.20) 

for some (large) power of p. We thus verified (^9|a). The bound ( |5.9| b) is obtained 
in the same way, by noticing that we can as well get the estimate 

\\QL(t)\\ 2 < Ce~^ \\g(0) || 2 (1 + ||x(0)|| + ||y(0)|| + \\y(t)\\f , 

instead of ( 5.20| ). The proof of Theorem SA is complete. □ 
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Remark 6.5 The whole construction is strongly reminiscent of what was done in 
[ EPR99| ] to control a finite Hamiltonian chain of non-linear oscillators with nearest- 



neighbour coupling driven by thermal noise at its boundaries. 

Remark 6.6 The linearity of the nearest-neighbour coupling is not essential for 
our argument. We could as well have replaced ( jx9| ) by 

(Ixq = (a 2 xo + V^ixi — xq) — V{(xq)) dt + duj , 
Xk = {a 2 - k 2 )x k + V^ixk-i - x k ) + Vz(xk+i - x k ) - V((x k ) , 

with V\ and Vi two polynomial-like functions, i.e. smooth functions such that 

d n Vi(x) _ Qi _ n 



dx n 



for a; I — > oo , 



for some on > 2. Imposing the condition V^(x) > c for some c > yields an 
effective coupling between neighbours at every point of the phase space. This is 
sufficient to apply our construction. 
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